Corpus: eng-gs_web_2014_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 49 61 70 74 75
1000 260 420 534 640 713
10000 1466 2889 3936 5243 6229
100000 17194 51442 69877 81255 87463
1000000 17194 51442 69877 81255 87463


Zipf's diagram for sentence endings


Gnuplot diagram

5253 msec needed at 2018-04-14 08:32